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d , Abstract 



We address the problem of estimating the Weibull tail-coefficient which is the regular vari- 
ation exponent of the inverse failure rate function. We propose a family of estimators of this 
coefficient and an associate extreme quantile estimator. Their asymptotic normality are es- 



' tablished and their asymptotic mean-square errors are compared. The results are illustrated 

on some finite sample situations. 
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1 Introduction 

Let Xi, X2, ■ ■ ■ , X n be a sequence of independent and identically distributed random variables 
with cumulative distribution function F. We denote by Xi >n < . . . < X n ^ n their associated order 
statistics. We address the problem of estimating the Weibull tail-coefficient 8 > defined when 
the distribution tail satisfies 

(A.l) f - F(x) = exp(-if(»), x > x > 0, H^(t) = M{x, H(x) >t} = t e £(t), 

where I is a slowly varying function i.e. 

£(Xx)/£(x) — > 1 as x — > 00 for all A > 0. 

The inverse cumulative hazard function H is said to be regularly varying at infinity with index 9 
and this property is denoted by G IZg, see [7j for more details on this topic. As a comparison, 



Pareto type distributions satisfy (1/(1 — F))^ 6 7£ 7 , and 7 > is the so-called extreme value 
index. Weibull tail-distributions include for instance Gamma, Gaussian and, of course, Weibull 
distributions. 

Let (k n ) be a sequence of integers such that 1 < k n < n and (T n ) be a positive sequence. We 
examine the asymptotic behavior of the following family of estimators of 9 : 



Following the ideas of [10] , an estimator of the extreme quantile x Pn can be deduced from (Q} by: 



Recall that an extreme quantile x Pn of order p n is defined by the equation 

1 - F(x Pn ) = p n , with < p n < l/n. 

The condition p n < l/n is very important in this context. It usually implies that x Pn is larger 
than the maximum observation of the sample. This necessity to extrapolate sample results to areas 
where no data are observed occurs in reliability [8], hydrology [21], finance [9],... We establish in 
Section [2] the asymptotic normality of 9 n and x Pn . The asymptotic mean-square error of some 
particular members of ([1]) are compared in Section |3] In particular, it is shown that family (fT) 
encompasses the estimator introduced in |12) and denoted by 9 n in the sequel. In this paper, the 
asymptotic normality of 9 n is obtained under weaker conditions. Furthermore, we show that other 
members of family (fT) should be preferred in some typical situations. We also quote some other 
estimators of 9 which do not belong to family (fT]): [1] [3] [TS] . We refer to Q2] for a comparison 
with §n ■ The asymptotic results are illustrated in Section [4] on finite sample situations. Proofs 
are postponed to Section [5] 

2 Asymptotic normality 

To establish the asymptotic normality of 9 n , we need a second-order condition on £: 
(A. 2) There exist p < and b(x) — > such that uniformly locally on A > 1 



with K P (X) = £uP- l du. 

It can be shown [11] that necessarily \b\ 6 1Z P . The second order parameter p < tunes the rate 
of convergence of £(Xx)/£(x) to 1. The closer p is to 0, the slower is the convergence. Condition 
(A. 2) is the cornerstone in all proofs of asymptotic normality for extreme value estimators. It is 




(1) 






when x — > 00, 



used in jTS] [T71 [5] to prove the asymptotic normality of estimators of the extreme value index 7. 
In regular noted in [13], one can choose b(x) — x£'(x)/£(x) leading to 

XB~~ X 

h{x) = F-i(l-e-)/(F-i(l-e-)) ~ (3) 
where / is the density function associated to F. 
Let us introduce the following functions : for t > and p < 0, 

H P (t) = J™ K p (l + ~)e-*dx 

and let a n — ^o(log(n/fc„))/T„ — 1. As a preliminary result, we propose an asymptotic expansion 
of (§ n - 0): 

Proposition 1 Suppose (A.l) and (A. 2) hold. If k n — > 00, k n /n — > 0, T n \og(n/k n ) — > 1 and 
ki /2 b(\og(n/k n )) -> A 6 K i/ien, 

k}J 2 {e n ~6) = 6Ca,i + ^o(log(n/fc„))C„, 2 + kl/ 2 0a n 
+ fcy 2 6(log(n/fc„))(l+ 0p (l)), 

where £ ni i and ^ nj 2 converge in distribution to a standard normal distribution. 

Similar distributional representations exist for various estimators of the extreme value index 7. 
They are used in [16] to compare the asymptotic properties of several tail index estimators. In |15j . 
a bootstrap selection of k n is derived from such a representation. It is also possible to derive bias 
reduction method as in |14j . The asymptotic normality of n is a straightforward consequence of 
Proposition [1] 

Theorem 1 Suppose (A.l) and (A. 2) hold. If k n — > 00, k n /n — > 0, T n \og{n/k n ) — > 1 and 
kn /2 b(\og(n/k n )) -4 A G K then, 

K /2 0n - - 6(log(n/fc„)) - 0a„) 4 A/"(0,# 2 ). 

Theorem [1] implies that the Asymptotic Mean Square Error (AMSE) of 9 n is given by : 

AMSE(9 n ) = (8a n + b(\og(n/k n ))) 2 + L. (4) 

fan 

It appears that all estimators of family (JTJ) share the same variance. The bias depends on two 
terms 6(log(n/fc„)) and 9a n . A good choice of T n (depending on the function b) could lead to a 
sequence a n cancelling the bias. Of course, in the general case, the function b is unknown making 
difficult the choice of a "universal" sequence T n . This is discussed in the next section. 
Clearly, the best rate of convergence in Theorem [T] is obtained by choosing A / 0. In this case, the 
expression of the intermediate sequence (k n ) is known. 



1/2 

Proposition 2 If k n — > oo, k n /n — > and k n ' 6(log(n/fc„)) — > A ^ 0, 

2 



fc " ~ Li ^ ft ) = A 2 (lo g (n))- 2 'L(log(n)), 
\b(\og(n))J 



where L is a slowly varying function. 

The "optimal" rate of convergence is thus of order (log(n))~ p , which is entirely determined by the 
second order parameter p: small values of \p\ yield slow convergence. The asymptotic normality 
of the extreme quantile estimator ^) can be deduced from Theorem [TJ 



Theorem 2 Suppose (A.l) and (A. 2) hold. If /rwO, T„log(n/fc„) ^ 1, 

1 < liminf t„ < limsupT n < oo (5) 



1 /2 

k n b(\og(n/k n )) —> and 



then, 

logr„ \x Pn J 

3 Comparison of some estimators 

First, we propose some choices of the sequence (T n ) leading to different estimators of the Weibull 
tail-coefficient. Their asymptotic distributions are provided, and their AMSE are compared. 

3.1 Some examples of estimators 

- The natural choice is clearly to take 

T„ = T« =: Mo (log(n/fc n )), 
in order to cancel the bias term a n . This choice leads to a new estimator of denned by : 

k 

^ = Mn,k n) ) V n XWn-^n) - log(* m - fc , +1 ,„)). 

Remarking that 

p p (t) = e * / e- tu u p - l du 



Ji 

provides a simple computation method for fi$ (login /k n )) using the Exponential Integral (EI), see 
for instance p], Chapter 5, pages 225-233. 

- Girard [12] proposes the following estimator of the Weibull tail-coefficient: 

k n I k„ 

§™ = g(l g(X„_ i+1 , n ) - log(X„_ fe „ +1 , n )) / £(log 2 (n/i) - log 2 (n/k n )) , 

i=l / i=l 

where log 2 (x) — log(log(x)), x > 1. Here, we have 

k 



T =T (2) = -^Vlo K 



(2) 

It is interesting to remark that T n is a Riemann's sum approximation of /io(log(n/fc n )) since an 
integration by parts yields: 

/*(t)= Aogfl-^W 



(i 



Finally, choosing T„ as the asymptotic equivalent of no(\og(n/k n )), 



T n = T<® =: l/log(n/fc„) 

leads to the estimator : 

~( 3) = log(n/fc n ) j^ {log(Xn _ i+i n) _ logiXn _ kn+hn) y 

Kn , 

2 — 1 

For i = 1, 2, 3, let us denote by Xp2 the extreme quantile estimator built on 0^ by ([2]). Asymptotic 
normality of these estimators is derived from Theorem[T]and Theorem^ To this end, we introduce 
the following conditions: 

(C.l) k n /n^Q, 

(C.2) log(fc n )/log(n)^0, 

(C.3) kn/n ->• and k n /2 / log(n/fc„) -> 0. 

Our result is the following: 

1/2 

Corollary 1 Suppose (A.l) and (A. 2) hold, k n — >• oo and kn b(log(n/k n )) -> 0. For i = 1,2,3: 
// (C.i) /io/d t/ien 

ii) If (C.i) and (0) /ioZd 7 </ien 



A: 



1/2 /Ai) 



logr„ yx. 

In view of this corollary, the asymptotic normality of n ^ is obtained under weaker conditions than 
6n and n , since (C.2) implies (C.l). Let us also highlight that the asymptotic distribution of 

A f2) 1/2 

n is obtained under less assumptions than in [12], Theorem 2, the condition k n ' / log(n/fc„) — > 

1/2 

being not necessary here. Finally, note that, if 6 is not ultimately zero, condition k n b(\og(n/k n )) — > 
implies (C.2) (see Lemma [TJ. 



3.2 Comparison of the AMSE of the estimators 

We use the expression of the AMSE given in (Q} to compare the estimators proposed previously. 



1 /2 

Theorem 3 Suppose (A.l) and (A. 2) hold, k n — > oo, log(fe n )/log(n.) — > andk n ' b(\og(n/k„)) — > 
A€R. Several situations are possible: 

k 

i) b is ultimately non-positive. Let us introduce a = —4 lim fr(logn)- ; — G [0,+col. 

'Woo log k n 

If a > 9, then, for n large enough, 

AMSE0W) < AMSE(§W) < AMSE0W). 
If a < 9, then, for n large enough, 

AMSE0W) < mw(AMSE{§W),AMSE(§W)). 

ii) b is ultimately non-negative. Let us introduce (5 — 2 lim xb(x) £ [0, +oo]. 

x— >oo 

If ft > 9 then, for n large enough, 

AMSE{6$) < AMSE{9^) < AMSE0W). 
If ft < 6 then, for n large enough, 

AMSE0W) < min(AMSE0W),AMSE(§W)). 

It appears that, when b is ultimately non- negative (case ii)), the conclusion does not depend on the 
sequence (k n ). The relative performances of the estimators is entirely determined by the nature 
of the distribution: §n has the best behavior, in terms of AMSE, for distributions close to the 
Weibull distribution (small b and thus, small ft). At the opposite, 9^ should be preferred for 
distributions far from the Weibull distribution. 

The case when b is ultimately non-positive (case i)) is different. The value of a depends on k n , 
and thus, for any distribution, one can obtain a — by choosing small values of fc„(for instance 
k„ = — l/6(log n)) as well as a = +oo by choosing large values of k n (for instance k n = (l/6(log n)) 2 
as in Proposition^. 

4 Numerical experiments 

4.1 Examples of Weibull tail-distributions 

Let us give some examples of distributions satisfying assumptions (A.l) and (A. 2). 

Absolute Gaussian distribution |A/"(/x, <r 2 )|, a > 0. From 9 , Table 3.4.4, we have H^{x) = 
x e t{x), where 9=1/2 and an asymptotic expansion of the slowly varying function is given by: 



Therefore p = — 1 and b(x) — log(a;)/(4a;) +0(l/x). b is ultimately positive, which corresponds to 
case ii) of Theorem [3] with /3 — +00. Therefore, one always has, for n large enough: 

AMSE0W) < AMSE0W) < AMSE(9™). (6) 

Gamma distribution T(a, A), a, A > 0. We use the following parameterization of the density 

f(x) = -^x a - 1 exp(-\x). 
T(a) 

From [5], Table 3.4.4, we obtain H*~(x) = x 9 £(x) with 9 = 1 and 

«, s 1 a - 1 logs , 

= T + ^ — !h + 0(VaO- 

A A a; 

We thus have p = — 1 and b(ar) = (1 — a)log(x)/x + 0(1/ x). If a > 1, b is ultimately negative, 
corresponding to case i) of Theorem [31 The conclusion depends on the value of k n as explained 
in the preceding section. If a < 1, b is ultimately positive, corresponding to case ii) of Theorem [3] 
with f3 = +00. Therefore, we are in situation @. 

Weibull distribution W(a, A), a, A > 0. The inverse failure rate function is H i ~(x) = Ax 1 /", 
and then = 1/a, £(x) = A for all x > 0. Therefore 6(x) = and we use the usual convention 
p = — 00. One may apply either i) or ii) of Theorem [3] with a = (3 — to get for n large enough, 

AMSE0W) < min( AM S E (0^ '), AM SE (9^)). (7) 
4.2 Numerical results 

The finite sample performance of the estimators 9n \ #i 2 ' and 9^ are investigated on 5 different 
distributions: T(0.5,l), r(1.5,l), \Af(0,l)\, W(2.5, 2.5) and W(0.4,0.4). In each case, N = 200 
samples (X n ,i)i=i....,N of size n = 500 were simulated. On each sample (X n> i), the estimates 9^\(k), 

"(3) 

9 n i (k) and 9 n [(k) are computed for k = 2, ... , 150. Finally, the associated Mean Square Error 
(MSE) plots are built by plotting the points 

N 



if £(«?«(*)-' 



2 



j = 1, 2, 3. 



i=l 

They are compared to the AMSE plots (see (g|) for the definition of the AMSE): 

k, (9a® + b(\og(n/k))) 2 + j} 3 = 1, 2, 3, 

and where b is given by ([3]). It appears on FigureQ]- Figure [5] that, for all the above mentioned dis- 
tributions, the MSE and AMSE have a similar qualitative behavior. Figure[T]and Figure [2] illustrate 
situation ([6]) corresponding to ultimately positive bias functions. The case of an ultimately nega- 
tive bias function is presented on Figure [3] with the r(1.5, 1) distribution. It clearly appears that 



the MSE associated to #i 3) is the largest. For small values of k, one has MSE(§ft) < MSE(ffi) 
and MSE(§n^) > MSE(§n^) for large value of k. This phenomenon is the illustration of the 
asymptotic result presented in Theorem O). Finally, Figured] and Figure [5] illustrate situation (0 
of asymptotically null bias functions. Note that, the MSE of 9n^ and Qffl are very similar. As a 
conclusion, it appears that, in all situations, §n ' and share a similar behavior, with a small 
advantage to §„ . They provide good results for null and negative bias functions. At the opposite, 
6n should be preferred for positive bias functions. 

5 Proofs 

For the sake of simplicity, in the following, we note k for k n . We first give some preliminary 
lemmas. Their proofs are postponed to the appendix. 

5.1 Preliminary lemmas 

We first quote a technical lemma. 

Lemma 1 Suppose that b is ultimately non-zero. If k oo, k/n — > and fc 1 / 2 &(log(n/fe)) — > A e 
M, then log(fc)/ log(ra) -> 0. 

The following two lemmas are of analytical nature. They provide first-order expansions which will 
reveal useful in the sequel. 

Lemma 2 For all p < and q € N* , we have 

Kl { 1 + -) e~ x dx ~ — as t -> oo. 
p \ t ) ifi 

Let a { n ] = p Q {\og{n/k n ))/T^ - 1, for i = 1,2,3. 
Lemma 3 Suppose k —> oo and k/n 0. 
i) T r [ 1] \og(n/k) -> 1 and aiP = 0. 

(2) (2) 

ii) Tn \og(n/k) — > 1. // moreover log(fc)/ log(n) — > then ~ log(fc)/(2fe). 
mj T^ 3 - 1 log («/£;) = 1 and ai? ~ — l/log(n/fc). 
The next lemma presents an expansion of 

Lemma 4 Suppose /c — > oo cmrf /c/n — >■ 0. Under (A.l) and (A. 2), £/ie following expansions hold: 

K = { 0U i O) + 6(Iog(n/fc))trW(l + OP (l))) , 

w/iere 



and where E n -k+i,n is the (n — k + l)th order statistics associated to n independent standard 
exponential variables and {Fx, . . . , F k -\} are independent standard exponential variables and inde- 
pendent from E n —k+l,n- 

The next two lemmas provide the key results for establishing the asymptotic distribution of 9 n . 
Their describe they asymptotic behavior of the random terms appearing in Lemma 0J 

Lemma 5 Suppose k — > oo and k/n — > 0. Then, for all p < 0, 

p p 1 

pp(E n -k+l,n) ~ f7 p(-E'ri-fc+l,Ti) 



log(ra/fc) ' 

Lemma 6 Suppose k — > oo and k/n — > 0. Then, for all p < 0, 

(Ui p) - fi p (E n _ k+1<n )) 4 Af(0, 1). 



Cp(^n-fc+l 

5.2 Proofs of the main results 

Proof of Proposition [1] — Lemma [5] states that for p < 0, 



{uv> - f i p (E n _ k+1>n )) = Up), 



where £ n (p) -^"(Oj 1) f° r P < 0. Then, by Lemma 0] 



+ k ^b{\o g (n/k)) ( gp( \ fc+1,mK ;ff + Mp( \ fc+1 -" } ) (1 + 0P (i)). 

Since T„ ~ 1/ log(n/fc) and from Lemma [5l we have 

k^(9 n -9) = 9U,i + k x 'H - l) + fc 1/2 6(logKfc))(l + o P (l)), 

where -4 7V(0, 1). Moreover, a first-order expansion of jXo yields 

- 1 + (E n - k+ i >n - log(n/fe)j- 



(8) 



Mo(log(n/fc)) Mo (log(n/fc))' 
where r\ n e] min(^;„_A ;+ i, n , log(n//c)), max(^ n _ fc+ i, n , log(n/fc))[ and 



7 /*00 7 /*00 



Since for t > T > 0, f(-,t) is integrable, continuous and 

0/(3, t) 



6>f 



£ x\~ L __ _ e 



we have that 



Then, Lebesgue Theorem implies that /Zq 1 \t) ~ — 1/t 2 as t — > oo. Therefore, is regularly 
varying at infinity and thus 

A^V) p y { o \log(n/k)) ^ 1 
/i (log(n/fc)) /Lt (log(n/fe)) \og(n/k)' 

Since fc 1 ' 2 (£n— fc+i,n — log(n/fc)) 4.^(0,1) (see [12], Lemma 1), we have 

Mo (log(n/fc)) log(n/fc) U ' 2 ' {J) 

where £ n ,2 4 A/"(0, 1). Collecting (|SJ), (|9|) and taking into account that T n log(n/fc) — > 1 concludes 
the proof. ■ 

Proof of Proposition [2] Lemma [1] entails log(n/fc) ~ log(n). Since |6| is a regularly varying 
function, 6(log(n/fc)) ~ 6(log(n)) and thus, k 1 / 2 ~ A/6(log(n)). ■ 

Proof of Theorem [2] — The asymptotic normality of be deduced from the asymptotic 

normality of 8 n using Theorem 2.3 of 10 . We are in the situation, denoted by (S.2) in the 
above mentioned paper, where the limit distribution of x Pn /x Pn is driven by 9 n . Following, the 

1/2 ' 

notations of |10j . we denote by a n = k n the asymptotic rate of convergence of n , by /3 n = 9a n 
its asymptotic bias, and by C = Af(0, 2 ) its asymptotic distribution. It suffices to verify that 

log(r„) \og{n/k) -> oo. (10) 

To this end, note that conditions <j5j) and p n < 1/n imply that there exists < c < 1 such that 

/ \ , „ / log(n) \ log(fc) 
log(T») > c (t„ - 1) > c - °\ ' - 1 = c- 



log(n/fe) / \og(n/k)' 
which proves (fTU|) . We thus have 



^ 1 ^ 2 ^-0a n ( x P n 

-T — T. 



M{Q,0 2 



log T n " \X Pn 

Now, remarking that, from Lemma[2J /i (log(n/fc)) ~ l/log(n/fc) ~ T n , and thus a n — > gives the 
result. ■ 

Proof of Corollary [T] — Lemma [3] shows that the assumptions of Theorem [T] and Theorem [2] are 
verified and that, for i — 1,2, 3, fc 1 / 2 a^ ) ->• 0. ■ 

Proof of Theorem [3] 

i) First, from Q and Lemma [3] iii) , since b is ultimately non-positive, 

AMSE(§iP) - AMSE(§&) = -9(a^) 2 (e + 2 6(log( J ) /fc)) ^ < . (11) 
Second, from (Q}, 

AMSi^f ) - AMSE0W) = 0(4 2) ) 2 ^ + 2 b(1 ° g |" ) /fc)) ) . (12) 



If b is ultimately non-zero, Lemma [T] entails that log(n/fe) ~ log(n) and consequently, since |6| is 
regularly varying, 6(log(n/fc)) ~ 6(log(n)). Thus, from Lemma[3]ii), 

b(loe(n/k)) ,,,, . k , . 

2± ^~ 46(bg ">M*r- Q - (13) 

Collecting (fl"l"]) - (fl"3T) concludes the proof of i). 
ii) First, (fT2f and Lemma [3] ii) yields 

AMSE0W) - AMSE0M) > 0, (14) 

since b is ultimately non-negative. Second, if b is ultimately non-zero, Lemma [T] entails that 
log(n/fc) ~ log(n) and consequently, since |6| is regularly varying, b(\og(n/kj) ~ b(log(n)). Thus, 
observe that in ([TT]) . 

2 6(1 °y )) ~-26(lo g n)(lo g n)^-/3. (15) 
a„ 

Collecting (fTTj) . (fT4|) and (fT5|) concludes the proof of ii). The case when b is ultimately zero is 
obtained either by considering a — in (fT5|) . or = in (flT 
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Appendix: proof of lemmas 

Proof of Lemma [l] Remark that, for n large enough, 

\k^ 2 b(\og(n/k))\ < \k l ^b{\og{n/k)) - A| + |A| < 1 + |A|, 

and thus, if b is ultimately non-zero, 

o ^ 1 log(fc) < log(l + |A|) log|fe(log(n/fc))| 
~ 21og(n/fc) ~ log(n/fc) log(n/fe) ' 

Since |6| is a regularly varying function, we have that (see [7], Proposition 1.3.6.) 

log|b(logQr))| 

: — — > as x — > oo. 

\og(x) 

Then, ([TB")) implies log(fc)/ log(n/fc) — > which entails log(fc)/ log(n) — > 0. 

Proof of Lemma [2] Since for all x, t > 0, tK p (l + x/t) < x, Lebesgue Theorem implies that 

&s, i °° K ( x + f) ) q c ~ Xdx = f K i 1 + 1 ) ) 9 c ~ Xdx = r xqe ~ Xdx = ql ' 

which concludes the proof. 
Proof of Lemma [3] 

i) Lemma [2] shows that /xo(i) ~ 1/t and thus Tn log(n/fc) — > 1. By definition, an = 0. 



ii) The well-known inequality —x 2 /2 < log(l + x) — x < 0, x > yields 
III 1 1 



~ £ log 2 (fc/*) < log(n/fc)Ti 2 ) - i £ log(fe/i) < 0. (h 



2 log(n/fc) fc ^ ~° v "' v " "~ ov "' A; 
Now, since when k — > oo, 

- ^log 2 (fc/i) ->■ / log 2 (x)<fc = 2 and - ^log(fc/i) -> - / log(a:)da; = 1, 
k i=i ^ fc i=i 

it follows that T n ' \og(n/k) -)• 1. Let us now introduce the function defined on (0, 1] by: 

log(z) 



fn(x) = log 1 

V log(n//c) 

We have: 



T, 



^(T^ - Mo( io g (n/fc))) = - * - f urn 

i /-(*+i)/fe i /•!/«= 



J n i = i J i/k In J 

Since 

fn(t) = fn (*/*) + (* - (*/*) + / (* - tffP 

Ji/k 



where is the pth derivative of /„, we have: 
fe- 1 c(*+i)/fc 

i/k 



i fe-l r(i+l)/k rt i /-1/fc 

+ ^)E / / (t-x)ti 2) (x)dxdt + —- f n (t)dt=:9 1 + 9 2 + 9 3 . 



Let us focus first on the term 

1 n i=l 



1 ■/l tf,( "* + ^(i§ tf,( ' / * ) -/l tf ' (x) * 



2kT ( n 2) 2kTk ' ~{ Ji/k 

Since ~ l/log(n/fc) and log(fc)/ log(n) — > 0, we have: 

Furthermore, since, for n large enough, /l 2) (x) > for x G [0,1], 
fe-i r(*+i)/fc 



O < *1 2 < 

2fcT, 



-(5) E / + !)/ fc ) - fPa/Wdx = — -pj - 



2fc 2 ri 2) \ lo g(«-/ fc ) log(n/fc) V log(n/fc)/ y 2fc 
Thus, 

* 1 = _i2|M (1 + o(1)) . (18) 

Second, let us focus on the term \I/2 - Since, for n large enough, fn\x) > for ir G [0, 1], 

0<*2 < — /. (t-i/k)fW( X )dxdt 

Finally, 

T? (2) 7 log(n/fc) T? (2) 7 V log(n/fc) / 

and we have: 

Furthermore, using the well known inequality: | log(l + x) — x\ < x 2 /2, x > 0, we have: 

(iog(fc)) 2 j\o g {ky 



2fclog(n/fc) V fc 



since log(fe)/ log(n) — > 0. Thus, 



We conclude the proof of i) by collecting (fT8|) - (f20| . 

ii) First, log(n/fc) = 1 by definition. Besides, we have 

4 3) = ^^-l = log(n/^o(log(n/ fc ))-l 



*3 = ^(1 + 0(1)). (20) 



^ 00 logK fc )log(l + I ^ y )e-^-l 



1 1 f°° x 2 1 



where 



R " = l log(n/fc) f log + 



log(n/fc) J log(n/fc) 2(log(n/fc)) 2 , 

Using the well known inequality: | log(l + x) — x + x 2 /2| < x 3 /3, x > 0, we have, 

1 f 00 x 3 _ ( 1 

l-RJ < r / « ; — tttttt e x dx = o 



3 Jo (i°g(«y fc )) 2 v io g( n / fc )/ 

which finally yields off ~ — l/log(n/fc). ■ 

Proof of Lemma [4] Recall that 

1 1 fc_1 

On =■ — -^(log(X„_ l+ i^) - log(X n _ fe+1 , n )), 
n i—1 

and let -Ei,n, ■ • ■ , £n,n be ordered statistics generated by n independent standard exponential ran- 
dom variables. Under (A.l), we have 

^2(log H^(E n - i+li n) -log H^(E n - k +X,n)) 



i—i 



\ l—l X 2—1 



lot 



Define x n = B„-fc+i,n an d Aj,„ = E n —i+± tn / E n -k+i lTl . It is clear, in view of |12j . Lemma 1 that 
x„ — > oo and Ai jM — » 1. Thus, (A. 2) yields that uniformly in i = 1, . . . , k — 1: 



» = ^ ( 4 £ l0 § f IM^) + (i + » P (i))^»- fc+ i,iE R P ( E " 



n—i+l.n 



™ \ i=l V-^n— fc+l,n/ K i=1 \ £ 'n-fc+l,ri 

The Renyi representation of the Exp(l) ordered statistics (see [2J, p. 72) yields 

E n -i+l,n 1 (i [, -Pfe-i,fe-l 



I i^-Ff^ ' (21) 

where {i^i fc-i, . . . , Fk-i t k-i} are ordered statistics independent from E n -k+i.n and generated by 
k — 1 independent standard exponential variables {-F\, . . . , -Ffc-i}- Therefore, 



(1 + o p (l))&(£„_ fc+1 ,„)~ K p ( 1 



Remarking that Kq(x) = log(a;) concludes the proof. 
Proof of Lemma \E\ — Lemma [2] implies that, 

, p 1 p 1 



fJ- P (E n . 



■k+l,n) 



E n -k+i,n log(n/fc)' 



since -E n -fc+i,n/log(n/A;) — > 1 (see [T2], Lemma 1). Next, from Lemma[2J 

a 2 p {E n ^ +1 , n ) = -g-? (1 + 0P (1))_ * (l + op(l)) 

= 5^ (1 + op(1) ^WF (1 + op(1,) ' 

which concludes the proof. ■ 
Proof of Lemma [6] Remark that 

jul/2 . . fc^ 1 / 2 fe_1 / / F- \ \ 

— -(C/W-/x p (£J n _ fc+1 , n ) = — J£IkJi + —* -M P (^-*+i,n) 

0>(.-E'n-fc+l,nJ v 7 <J p (£j n -k+l,n) ~[ \ \ £*n—k+l,n J / 

_ j -l/2 f 1 p(En-k+l,n) 

Let us introduce the following notation: 
Thus, 

ttt" \ (^ P) - » P (E n - k+ l,n)) = 5„(£7 n _ fc+ i, n )(l + o(l)) + o P (l), 

from Lemma [5l It remains to prove that for i£l, 

P(S' n (£ , n _ fc+ i,„) <x)- $(x) -> as n -> oo, 

where $ is the cumulative distribution function of the standard Gaussian distribution. Lemma [5] 
implies that for all e €]0, 1[, there exists T £ such that for all t > T £ , 

|(l-,)<E((^(l + f)) 9 )<|(l + £) . (22) 

Furthermore, for 

P(S n (E n _ k+1>n ) <x)- $(x) = / E (P(S n (t) <x)- $(x))h n (t)dt 



+ / (P(S n (t) < x) — <&(x))h n (t)dt —: A n + B n , 



where h n is the density of the random variable E n -k+i,n- First, let us focus on the term A n . We 
have, 

\A n \ < 2P(E n _ k+1 , n <T E ). 



p 

Since E n -k+i, n /^og(n/k) — > 1 (see [12] . Lemma 1), it is easy to show that A n —> 0. Now, let us 
consider the term B n . For the sake of simplicity, let us denote: 

Yi = K p (l + ?^j- n P {t), i = 1, . . . , k - 1 

Clearly, Y%, . . . , Yfc-i are independent, identically distributed and centered random variables. Fur- 
thermore, for t > T e , 

3\ 



E(\Y X \ 3 ) < E + 



+ 3E (V p (l + (^(i)) 2 
< 4Ci(g,e) < oo, 

from (22) where Ci(q,e) is a constant independent of t. Thus, from Esseen's inequality (see 
Theorem 3), we have: 

sup \P(S n (t) <x)- S(a:)| < C 2 L„, 



where C2 is a positive constant and 



From J22l), since £ > XI 



(a p (t)f=E\^K p (l+~-f)) j-^\K p \l + ^-jj j >^C 3 (s), 

where Cz(e) is a constant independent of t. Thus, i„ < (k — l) _1 / 2 C4(g, e) where 6*4(9, e) is a 
constant independent of t, and therefore 

|S„| < C 4 (q,e)(k - lJ-Vap^ > T £ ) < C 4 (q,e)(k - I)- 1 ' 2 -> 0, 



which concludes the proof. 



mean 




